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The  First  Annual  Report  for  Project,  Titled: 
Implementation  of  Computer  Assisted  Breast  Cancer  Diagnosis 
(US  Army  Grant  No.  DAMD17-93-J-3007) 


1.  Introduction 

Many  investigators  have  attempted  to  analyze  mammographic  abnormalities.  Recently,  several 
investigators  have  proposed  various  methods  for  the  automatic  detection  of  microcalcifications  and 
masses  on  mammograms.  Vast  improvements  in  accuracy  have  been  made  since  the  initial  attempt  [Chan 
1987;  1988]  to  apply  the  computer  algorithms  for  the  detection  of  microcalcifications.  We  believe  that  it 
is  important  to  implement  the  program  into  a  high  speed  workstation  and  conduct  a  large  scale  preclinical 
trial  in  order  to  evaluate  its  clinical  practicability  and  limitations.  The  false-positive  rate  for  the  detection 
of  masses  is  still  very  high.  On  the  other  hand,  we  believe  that  the  creation  of  a  computer  program  to 
analyze  features  of  suspected  masses  will  give  rise  to  a  more  useful  and  fundamental  approach  to 
computer-aided  diagnosis.  We  propose  to  use  an  artificial  neural  network  to  classify  malignant  and 
benign  masses. 

With  the  use  of  high-resolution  digital  mammograms,  data  compression  is  an  important  means  to 
facilitate  the  mammographic  image  transmission  and  storage.  We  have  studied  some  characteristics  of 
the  mammograms  using  gray  value  splitting  and  full-frame  discrete  cosine  transform  (DCT)  methods. 
Effects  of  applying  the  data  compression  to  the  proposed  computer  aided  diagnosis  (CADx)  scheme  in 
the  detection  of  microcalcifications  were  also  tested  in  our  preliminary  evaluation. 

2.  Refinement  of  Image  Processing  Prior  to  the  Detection  of  Microcalcifications 

At  Georgetown,  we  have  developed  a  band-passed  filter  based  on  the  wavelet  transform  domain. 
Since  the  wavelet  transform  can  decompose  both  frequency  and  local  spatial  information  into  its 
transform  domain,  some  breast  tissue  structures  (e.g.,  vessels  and  ducts,  etc.)  can  be  easily  extracted  in 
the  transform  domain. 

In  the  wavelet  transform,  many  line  structures  were  extracted  according  to  three  different  high 
frequency  regions,  namely;  horizontal,  vertical,  and  diagonal.  In  these  regions,  one  can  easily  detect  the 
line  and  band  structures  using  modified  Hough  Transform.  Removed  lines  and  bands  are  compensated 
by  relaxation  algorithms  in  the  original  image.  Once  these  background  structures  are  reduced, 
microcalcifications  can  be  extracted  much  more  accurately  by  the  following  CADx  detection  procedure. 
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We  also  spent  a  great  deal  of  effort  towards  the  improvement  of  the  original  CADx  program. 
The  microcalcification  searching  algorithm,  previously  bottlenecked ,  has  been  greatly  improved.  The 
new  program  uses  ’’the  chain  algorithm"  to  search  for  the  boundary  of  each  island  based  on  a  given 
threshold.  The  suspected  microcalcifications,  which  are  "islands"  in  a  large  image,  are  tested  by 
histogram  thresholding  and  root-mean-square  variation  methods. 


4.  Mammo graphic  Image  Compression. 

Based  on  the  splitting  method  which  was  employed  to  reduce  edge  effects  and  to  obtain  maximum 
compression  efficiency  [Lo  1991],  we  have  refined  our  compression  method  using  alternate  value 
contour  coding  and  full-frame  entropy  encoding.  Trade-off  studies  between  irrerversible  and  error-free 
compression  were  also  tested  on  selected  chest  and  mammographic  images  [Lo  1993].  We  have  tested 
15  digitized  mammograms  based  on  a  refined  compression  scheme.  The  results  are  reported  below: 

.1.  Error-Free  Compression  for  Images  Containing  Most  Significant  Values 

We  have  developed  an  efficient  compression  method  called  "alternate  value  contour  coding”  for  the 
step-type  image.  The  most  significant  value  images  containing  the  3  most  significant  bits  (3MSB)  of 
digital  radiographs  belong  to  this  type  of  image.  We  have  tested  the  newly  developed  compression 
method  on  chest  radiographs.  This  method  performed  much  better  for  error-free  compression  than 
DPCMfrun-zero/arithmetic  coding  proposed  earlier  due  to  the  method  of  turning  the  entire  2-D  image  data 
into  a  1-D  edge  tracking  sequence.  In  addition,  contouring  far  the  adjacent  values  is  ignored  and  the 
image  data  is  fully  recoverable.  The  only  drawback  of  the  alternate  value  contour  coding  is  that  the 
algorithm  is  somewhat  complicated  and  demands  error  checking  procedures  to  ensure  the  error-free 
requirement  is  fulfilled. 

.2.  Compression  for  Images  Containing  Least  Significant  Value 

Based  on  full-frame  entropy  encoding  (FFEQ,  the  remapped  9  least  significant  value  (R9LSB) 
images  were  decomposed  by  2-D  full-frame  DCT  followed  by  a  quantization  procedure  and  an  entropy 
coding  (arithmetic  encoding)  as  indicated  in  our  proposal.  Preliminary  results  obtained  from  the  studies 
using  the  proposed  methods  are: 
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4.2.1.  Quantization  -  we  have  evaluated  the  density  distribution  of  DCT  coefficients  for  several  chest 
R9LS  V  images.  We  found  that  the  distribution  density  of  R9LSV  image  can  be  modelled  by  a  Gaussian 
function.  However,  the  standard  deviation  of  the  Gaussian  distribution  tends  to  be  large  which  makes 
non-linear  quantization  [Max  1960,  Modestino  1985]  less  useful  The  reason  for  a  broad  Gaussian 
distribution  is  that  the  low  bit  data  contains  a  much  lower  signal  to  noise  ratio.  It  is  relatively  difficult  to 
quantize  and  to  encode  noise  dominated  images  both  in  the  spatial  and  in  the  frequency  domain.  Our 
initial  results  indicated  that  the  advantage  of  using  a  non-linear  quantizer  over  a  linear  quantizer  is  very 
small  (about  5-8%)  for  R9LSV  images. 

4.2.2  Noise  Evaluation  and  Coding  -  As  far  as  RLSV  images  are  concerned,  the  FFEC  [Lo  1991- 
SPIE]  is  the  primary  algorithm.  We  found  that  digitized  radiographs  contained  not  only  white  noise  but 
also  system  structure  noises  (e.g.,  system  electronic  and  mechanical  noises  and  dust  on  the  computed 
radiographic  plate  or  lens).  Although  the  splitting  method  has  partially  solved  problems  causing  by 
sharp  edges,  spots  and  shallow  lines  are  the  main  structures  for  encoding.  We  therefore  spent  some  time 
overseeing  image  quality  and  consulting  with  vendors  to  adjust  our  computed  radiographic  and  laser  film 
digitizer  systems.  However,  we  did  not  succeed  in  overcoming  all  the  structure  noises,  particularly  in 
film  digitizers.  We  have  evaluated  images  using  a  step  wedge  and  found  that  a  minimum  of  4  of  the  least 
significant  bits  out  of  12-bit  values  are  noise.  These  results  are  confirmed  by  both  signal  to  noise  (S/N) 
and  covariance  studies  with  single  displacement  Among  the  sampled  gray  spectra,  the  maximum  S/N  is 
120.  By  removing  up  to  4  least  significant  bit  data  through  round-off,  the  maximum  covariance  is  less 
than  0.07.  However,  the  test  of  covariance  is  drastically  increased  to  0.2  with  the  5  least  significant  bit 
data.  These  results  indicated  that  the  image  data  contained  only  about  8-bit  information.  Based  on  these 
noise  characteristics,  we  can  limit  our  frequency  quantization  corresponding  to  gray  value  variance  (e.g., 
15). 

4.3.  Image  Compression  in  Digital  Mammography  and  Its  Effects  on  Computerized  Detection  of  Subtle 

Micrecalcifications 

Our  previous  receiver  operating  characteristic  (ROC)  study  indicated  that  the  detection  accuracy 
of  microcalcifications  by  radiologists  is  significantly  reduced  if  mammograms  are  digitized  at  0.1mm. 

Our  recent  study  also  showed  that  detection  accuracy  by  computer  decreases  as  the  pixel  size  increases 
from  0.035mm.  Clearly,  the  digitization  of  mammograms  requires  very  large  matrix  sizes.  Efficient 
compression  techniques  will  be  needed  to  facilitate  communication  and  archiving  of  digital 
mammograms. 

In  this  study,  mammograms  were  digitized  with  a  laser  scanner  at  a  pixel  size  of 0.035  mm  and 
12  bits.  We  studied  two  compression  techniques:  (a)  full  frame  DCT  coding  with  entropy  coding  and 
splitting  of  bits,  and  (b)  Laplacian  pyramid  hierarchical  coding  (LPHC)  with  linear  requantization.  The 
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effectiveness  of  the  techniques  is  compared  in  terms  of  the  bit  rate,  the  mean-square -error,  the  visual 
quality  of  reconstructed  and  error  images,  and  the  detection  of  microcalcifications  by  computer. 

With  LPHC,  significant  degradation  of  detection  accuracy  was  observed  when  the  compression 
ratio  was  greater  than  3.6: 1.  The  DCT  technique  provided  a  higher  compression  efficiency  at 
comparable  detection  accuracy.  A  compression  ratio  of  9.6:1  was  achieved  without  significant 
degradation  in  the  detection  of  microcalcifications.  Furthermore,  it  was  found  that  the  mean-square 
error  was  not  a  good  indicator  for  the  evaluation  of  information  loss  due  to  image  compression. 

In  summary,  our  study  showed  that  there  is  a  trade-off  between  reconstructed  image  quality  and 
compression  efficiency.  Further  investigation  is  needed  for  selection  of  optimal  compression  technique 
for  digital  mammography. 

5.  Implementation  of  CADx  for  the  Detection  of  Clustered  Microcalcifications 

We  have  started  to  implement  the  CADx  program  into  a  DEC  Alpha  workstation  which  is 
currently  the  fastest  workstation  on  the  market  The  basic  user  interface  is  nearly  complete.  However,  it 
requires  some  final  modifications.  The  user  interface  can  select  a  mammogram  and  display  it  on  the 
workstation.  Several  basic  image  functions  are  also  available:  (1)  "window  and  level"  for  the  adjustment 
of  the  brightness  and  contrast  (2)  pan,  and  (3)  a  cursor  box  for  the  user  to  select  the  area  of  interest 

6.  Contractual  (SQWUssucs 

We  have  not  completed  our  research  and  implementation  of  CADx  workstation.  At  this  point  we 
have  contacted  Dr.  R.  V.  Shah,  Chief  Br&t  Radiologist  at  Brook  Army  Medical  Center  and  Dr.  Don 
Smith,  attendant  breast  radiologist  at  Madigan  Army  Medical  Center.  They  have  agreed  to  provide  their 
proven  cases  associated  with  mammographic  microcalcifications  for  inclusion  in  our  test  database 
[Private  Communication].  We  will  provide  our  software  for  the  evaluation  at  Army  Hosipitals  after  the 
end  of  this  project 

7.  Conclusion  of  the  Annual  Report 

At  this  point  we  have  refined  our  CADx  algorithms  in  image  preprocessing,  detection  effectiveness, 
and  computer  speed.  We  have  also  done  studies  in  mammographic  image  compression.  More  studies 
on  the  impact  of  compression  on  the  CADx  are  in  progress.  Database  collection  is  ongoing  and  will 
continue  up  to  the  final  stage  of  this  project  Several  basic  functions  and  user  interface  have  been 
implemented  in  the  workstation. 

During  the  next  year,  we  are  going  to  convert  and  test  some  of  our  FORTRAN  codes  to  C  computer 
language.  It  will  take  great  engineering  efforts  to  merge  our  newly  developed  algorithm  in  C  and  useful 
old  codes  developed  by  Dr.  Chan  and  her  colleagues.  We  will  spend  mostfour  research  time  in  the 
evaluation  of  the  effect  of  digitization  on  accuracy  of  CADx  using  the  proposed  computer  scheme. 
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